Python Program

– To Identify Silent Mutations for the Introduction of Restriction Sites in Open Reading Frames

Project Team

Associate Professor Anuradha Mathrani

School of Mathematical and Computational Sciences
Massey University
Auckland, New Zealand
Email: a.s.mathrani@massey.ac.nz
Web: My Profile

Associate Professor Evelyn Sattlegger

School of Food Technology and Natural Sciences
Maurice Wilkins Centre for Molecular BioDiscovery
Massey University
Palmerston North, New Zealand
Email: e.sattlegger@massey.ac.nz
Web: My profile

Our Project

For many applications in molecular biology, restriction sites need to be engineered into an open reading frame (ORF), a part of the genetic material that codes for a protein. Importantly, silent mutations need to be performed because only these do not alter the amino acid sequence of the protein. However, finding such silent mutations is a very time-consuming process. We have developed a program which recognizes all silent mutations in an open reading frame (ORF) that each lead to a new restriction site. Our program uses python technologies comprising web crawlers and data analysis libraries to deduce the amino acid sequence coded by an ORF and convert the ORF nucleotide sequence into the amino acid single letter sequence. In doing so, reverse translation back into the nucleotide sequence allows the consideration of all possible nucleotide sequences coding for the same amino acid sequence, which are then compared with the restriction recognition sites of commercially available restriction enzymes, such as from New England Biolabs (e.g., https://www.neb.com/). This allows the identification of restriction sites that can be engineered via silent mutations within the provided DNA input sequences.

Download the test application

The downloadable project .ZIP file bundlecontains 8 individual files, which are described below:

1. CheckRestrictionEnzyme.py
This is the source code in Python which user can download and compile themselves
2. CheckRestrictionEnzyme.exe
This is the Executable file on a Microsoft Windows operating system
3. TestDNA.txt
The sample DNA sequence text that can be inputted in the program
4. NEBEnzymeList.csv
This file contains the Enzyme data extracted from New England BioLabs (https://www.nebiolabs.co.nz/tools-and-resources/usage-guidelines/nebuffer-performance-chart-with-restriction-enzymes)
5. TemplateEnzymeList.csv
Template provided to user if they want to upload their own list
6. Instructions.doc
Brief user instructions are provided here
7. Detailed Instructions.doc
Flowchart with detailed instructions

Download the project .ZIP file

file size 63.6 MB